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SPECIFICATION 

Electronic Version 1.2.8 
Stylesheet Version 1 .0 

SYSTEM AND METHOD FOR 
PREDICTING ADDITIONAL 

SEARCH RESULTS OF A 
COMPUTERIZED DATABASE 
SEARCH USER BASED ON AN 
INITIAL SEARCH QUERY 

Background of the Invention 

[0001] The present invention relates generally to search query prediction. More 

particularly the present invention relates to predicting subsequent queries and 
retrieving the search results from the predicted queries based upon an initial search 
query utilizing a conventional search engine. 

[0002] Electronic searching for information contained in a database. is typically conducted 
by entering a query term or phrase into a search engine. Typically, the search query 
returns document results that are based on relevancy to the queried search term(s). 
With the growing use of the Internet and database technologies, query based 
searching has grown beyond simply searching databases contained locally within an 
organization, to searching file systems and external databases over the Internet due 
to its vast resources and wealth of information. The term document, as used in this 
application encompasses files, records, pages, Internet sites, data entries, or any 
other terminology used to identify a unit of a database. 

[0003] 

Computer users searching for particular documents conventionally use a typical 
search engine function, such as Microsoft's Indexing Service Query or the like. The 
user typically starts his search by entering a query, comprising at least one term, 
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inside the search engine input box. The term generally consists of an anticipated 
keyword or string of search terms, which the search engine then uses to search its 
database and generate a return list of documents containing such keyword(s) or are 
pre-designated as being related to such keyword(s). Preferably, only documents that 
are relevant to the queried search term are returned in the return list to avoid 
confusing and overwhelming the user with numerous irrelevant results. However, as is 
typically confronted using conventional search engines, many documents may be 
relevant to the queried term(s) and thus a result of thousands, if not millions, of 
documents may be provided in the results list, thus frustrating the user's task by 
requiring a manual review of each result returned to find the desired document. 
Additionally, the user may inadvertently select the incorrect term used within the 
search query and the desired document list is not returned. 

[0004] To combat this problem, some existing search engines have implemented 

refinement processes that attempt to limit or filter the user's search in order to narrow 
the amount of documents returned in the return list. An typical existing refinement 
process works by suggesting related query terms directly to the user based on the 
user's previous queried term(s). This process, generally called frequency searching, 
uses previous searches of the existing search user as a method of refinement by 
comparing the frequency the user has previously used certain terms in conjunction 
with other terms during a previous search. The process then attempts to provide 
refined search words by recommending combinations or related terms based upon 
that comparison. As such, this process is wholly dependent upon the frequency that 
exact search terms have historically been submitted. However, as anticipated, a 
limitation of this process is that it is confined to search terms previously used by the 
search user and thus does not have the capability to accurately and efficiently predict 
keywords that can be utilized. The present invention overcomes this limitation by not 
being dependent upon the terms previously utilized by the user. 



Another existing search engine refinement process is the use of not only the 
existing user's previously used terms, but the frequency of used terms based upon a 
group of searchers. This group can be as enormous as all Internet users or as small as 
a local area network. This process utilizes the same comparison technique as 
described above but has enhanced its abilities by enlarging its comparable terms. 
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Again, however, a limitation of this process is that the refined search terms are 
dependent upon the terminology of other finite searchers. The present invention 
overcomes this limitation by not being dependent upon the terminology used by other 
searchers. 

[0006] Yet another search engine refinement process is determining the frequency of 
occurrence of a search term within each of the documents that are generated by the 
initial search. After the search engine identifies the most frequently used term(s) 
within a document it presents these terms to the user as feasibly "related" terms for 
the user to use in order to refine his search. A limitation of this process is that the 
related terms are confined to the finite verbiage and term usage contained within the 
resultant documents. The present invention overcomes this limitation by not being 
confined to terms contained within a resultant document. 

[0007] Yet, there is missing in the art a reliable and accurate method of predicting the 
search user's additional needs, based upon the initial search query. For example, the 
return lists generated by the search engines of the existing processes do not 
anticipate potential needs of the searcher but rather produce results based either on 
the initial queried search term(s) or the terminology utilized by other searchers and 
are thus rather limited in their scope and applicability. The present invention 
overcomes these limitations by, for example, anticipating what the search user may 
need by relating the search term to a plurality of pre-defined categories, wherein each 
category has a number of documents associated with it. 

[0008] The present invention overcomes the disadvantages and/or shortcomings of 
known prior art search result providers and provides a significant improvement 
thereover. 

Brief Summary of the Invention 

[0009] It is therefore an object of the present invention to provide a system and method 
for efficiently and accurately predicting additional search needs of the user based on 
an initial search query executed through a conventional search engine. 

A further object of the present invention is to provide a system and method that 
reduces the time a user spends on searching for documents by providing a return list 
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of documents in real time, that in a single command simultaneously produces a return 
list that is comprised of directly relevant results and results that predict other needs 
of the searcher. 

[001 1] A still further object of the present invention is to provide a system and method to 
all platforms utilizing or benefiting from existing search engine functionality. 

[0012] Yet another object of the present invention is to provide a system and method that 
works in conjunction with return lists generated by conventional processes or as a 
uniquely separate return list on the specified platform. 

[001 3] It is also an object of the current invention to provide users with a predictive 

results list that will be arranged according to a document or category that can be used 
to enhance, compliment, strengthen, safeguard, or be used as an accessory to the 
original document or topical category listings selected by a service provider. 

[0014] The present invention is a unique and novel process for conducting queried 

searches through a conventional search engine by providing a system and method for 
reliably and accurately predicting additional search results and needs based on an 
initial search query. 

[001 5] In the preferred embodiment of the present invention, a user enters a search term 
or string of search terms in a conventional search engine input box. Once the user 
submits the initial search query, the search engine produces a list of predicted results 
based upon that initial search term. 

[0016] -|- ne preferred embodiment of the present invention maintains at least one 

database containing predefined categories. Each category is created, defined and 
maintained by a system provider or other 3rd party. Each respective category is 
preferably inter-related and cross-referenced to other independent categories either 
based upon their topical contents or other method. Once the categories are defined, 
anticipated search terms in a search term database are matched to at least one 
category based on definitional relevancy and/or linguistic usage and more likely they 
are matched to a plurality of categories. Furthermore, each document maintained in 
the document database is also matched to at least one predefined category based 
upon the approximated relationship between the content of the documents and/or the 
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predefined descriptions of each respective category. 

[001 7] Once the user submits an initial search query, only one predefined category is 

selected based upon the relevancy of the search term to the category and the search 
term's pre-designation to that category. The present invention selects the most 
popular category out of all the categories that the search term may belong to by 
implementing a set of preponderance criteria that calculates the number of times a 
particular category is selected by searchers in association with each search term used 
in the initial search, uses a subjective determination made by the editorial department 
as to which search terms belong to which categories, and/or calculates the number of 
times a search term is repeated within the pre-designated keywords contained within 
documents associated with the category. 

[001 8] When the initial category is determined and selected, the process of matching this 
category to other related or cross-referenced categories begins. The initial category 
for the search term is subsequently matched to other predetermined topical 
categories that enhance, compliment, or append the initial category, and thus 
approximates and predicts additional needs of the user. 

[001 9] After the user submits his or her initial query, a search result list comprising of 

documents belonging to associated categories is simultaneously displayed, preferably 
arranged by categories. Alternately, the categories can be arranged statistically, with 
the most relevant category appearing first followed by the next most relevant and so 
on. Still alternately, the search service provider can arrange the categories according 
to a subjective determination. Monetary information can also be used in the area of 
"Pay-Per-Click" search engines to determine the order that the categories appear. 
Furthermore, any known relevancy criteria can be used including the preponderance 
methodology described in this application to order the list of categorical results. Once 
the categories are ordered, documents are then automatically chosen to appear 
beneath each category. Documents can be arranged using the same set of criteria 
listed above for categorical ordering. 

[0020] Preferably, the predictive search results will be displayed simultaneously with the 

directly relevant search results. This can be accomplished by simultaneously using 
multiple display devices, or by using a single device with bifurcation functionality. It 
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should also be noted that the predictive search results can be used independently and 
be the only return result list. 

Brief Description of the Several Views of the Drawings 

[0021] The preferred embodiment is herein described in detail with references to the 
drawings, where appropriate, wherein: 

[0022] FIG. 1 is a flowchart depicting the preferred embodiment's method of categorizing 
documents contained within a database; 

[0023] FIG. 2 is a flowchart depicting an existing typical document and search result 
retrieval system; 

[0024] FIG. 3 is a flowchart depicting the preferred embodiment's method of predicting 
search results; 

[0025] FIG. 4 is a flowchart depicting the preferred embodiment's method of matching 
keywords to categories utilizing the preferred preponderance method; 

[0026] FIG. 5 is a flowchart depicting the preferred embodiment's method of determining 
associated categories of the initial category; and 

[0027] FIG. 6 is an example of a specific embodiment of the present invention as it 
relates to an online Pay-Per-Click search engine. 

Detailed Description of the Invention 

[0028] 

The preferred embodiment of the present invention utilizes a method of assigning 
anticipated search terms and documents to pre-defined topical categories in order to 
accurately and reliably predict a search user's needs. To achieve this result, the 
preferred embodiment of the present invention provides a topical category database 
with at least one pre-defined topical category. The topical categories are preferably 
defined with a title and a description, somewhat similar to encyclopedic topics. 
Alternately, the topical categories can be defined with other cataloging references, 
such as a numeric cataloging system, computer cataloging system, and the like. Each 
respective topical category is preferably related and cross-referenced with at least one 
other category based upon their respective topics, subjectively determined to be 
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related, relevant, or adjunct. 

[0029] The preferred embodiment of the present invention also provides a database of 
documents with information for at least one document. Preferably, the document 
database contains information relative to a respective document, such as topic, title, 
content, author, description, and location. Referring to FIG. 1 , the preferred 
embodiment of the present invention utilizes a subjective determination of 
systematically assigning a document 1 contained within the present invention's 
document database to at least one pre-defined topical category in the topical category 
database utilizing a preferred method where a document 1 is dissected into 4 
subparts; a description la, a title lb, content lc, and meta-tags Id. The sub-parts 
are used by the search service provider to subjectively evaluate the document 2 and 
compare the components of the document to the topical category list 3 contained 
within the topical category database to assign each document to the most relevant 
and appropriate topical category 4. Alternately, the present invention can categorize a 
document 1 utilizing any combination of the document's 1 description la, title lb, 
content lc, or meta-tags Id. Still alternately, a document can be assigned to a 
category by using any sub-part exclusively. Preferably, each document is assigned to 
at least one topical category. Alternately, documents are assigned to a plurality of 
topical categories. 

[0030] The preferred embodiment of the present invention also provides a database of 
anticipated search terms having at least one anticipated search term. The anticipated 
search terms contained within the anticipated search term database are also assigned 
to at least one of the pre-defined topical categories contained within the topical 
category database, as described above, based upon their respective definitions and/or 
common language usages, thus creating an anticipated search term assignment. As 
such, anticipated search terms and documents are assigned to common pre-defined 
topical categories contained within the same topical category database. 

[0031] 

Referring to FIGs. 3 and 4, the preferred embodiment of the present invention 
begins when a search user sends at least one desired initial search term 5 to the 
present invention via a conventional search term input box. The initial search term 
preferably contains at least one word the search user desires to search. Alternately, 
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the initial search term can contain a string of words. After receiving the initial search 
term, the present invention finds the most popular topical category 9 assigned to that 
initial search term. This step is accomplished by accessing the anticipated search term 
database, matching the initial search term to at least one anticipated search term 
within the anticipated search term database, and associating the initial search term 
with the pre-defined topical category(ies) assigned to the matched anticipated search 
term. 

If more than one pre-defined category is assigned to the initial search term, then 
the preferred embodiment of the present invention utilizes a preponderant method 1 0 
and selects the most popular topical category for that initial search term. The 
preponderant method preferably finds the most popular topical category 9, either in 
combination or exclusivity, by calculating the number of times a particular category is 
selected by searchers in association with each search term used in the initial search, 
termed popular "searcher" category choice 1 5, calculating the number of times a 
search term is repeated within the pre-designated keywords contained within 
documents associated with the category, termed highest frequency category choice 
16, or a subjective determination made by the service provider to automatically assign 
a most popular category 1 7. 

After assigning a most popular topical category 9 to the initial search term 5, the 
preferred embodiment of the present invention finds any associated categories of the 
initial search term 1 1 by associating the initial search term 5 with topical categories 
designated as inter-related and cross-referenced with the most popular topical 
category 9 assigned to the initial search term 5 and any other assigned categories of 
the initial search term. 

[0034] The present invention then organizes each associated category 1 2. The preferred 
embodiment organizes each successive associated category based upon the relevancy 
of the initial search term and the associated category. Alternately, successive 
associated categories may be organized statistically, most accessed, most documents 
assigned to the respective category, and the like. 

[0035] Tne p resen t invention then returns the document results for the requested initial 
search term 1 3 based upon the keywords of the respective documents matching the 
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initial search term. Preferably, the most popular category search results are listed first, 
by listing all documents assigned to the most popular category. It is anticipated that 
the most popular category assigned to the initial search result will contain the 
information that the search user was initially searching for. The present invention also 
displays the resultant documents assigned to the associated categories of the initial 
search result, which attributes the predictive nature of the present invention. 
Preferably the associated category documents are displayed contemporaneously with 
the document results of the most popular category. Alternately, the associated 
category documents are displayed by utilizing a split-screen configuration on a single 
display device or by providing a link to a new result page list. Furthermore, the 
document results are preferably in a form of hypertext links to the designated location 
of the document. Alternately, the actual documents themselves can be displayed. 

[0036] The present invention can be better illustrated with the following example which is 
intended to explain, and not limit, the invention. Referring to FIG. 6, an example 
embodiment of the present invention reflecting the use of the present invention in an 
online Internet search engine setting, where a searcher enters an initial search term 5 
"hotel" in the search input box, submits the search to the search engine provider, and 
a conventional split-column return list is displayed containing a most popular 
category documents results list 21 and an associated category documents result list 
22, where each associated category 12 and document information 2 3 are listed in 
ranked order as determined by the subjective/objective criteria selected by search 
engine provider (in this example, the criteria was the bid amount). 

[0037] While preferred and alternate embodiments have been described herein, it is to be 
understood that these descriptions are only illustrative and are thus exemplifications 
of the present invention and shall not be construed as limiting. It is to be expected 
that others will contemplate differences, which, while different from the foregoing 
description, do not depart from the true spirit and scope of the present invention 
herein described and claimed. 
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